Overview

Dataset info

Number of variables29
Number of observations151924
Missing cells0 (0.0%)
Duplicate rows0 (0.0%)
Total size in memory33.6 MiB
Average record size in memory232.0 B

Variables types

Numeric24
Categorical3
Boolean1
Date0
URL0
Text (Unique)0
Rejected1
Unsupported0

Warnings

co-borrower_credit_score has 61734 (40.6%) zeros Zeros
insurance_percent has 134211 (88.3%) zeros Zeros
m10 is highly skewed (γ1 = 39.16526857) Skewed
m10 has 151351 (99.6%) zeros Zeros
m11 is highly skewed (γ1 = 38.84844958) Skewed
m11 has 151353 (99.6%) zeros Zeros
m12 is highly skewed (γ1 = 37.68468478) Skewed
m12 has 151281 (99.6%) zeros Zeros
m2 is highly skewed (γ1 = 32.95069434) Skewed
m2 has 151642 (99.8%) zeros Zeros
m3 is highly skewed (γ1 = 42.90379644) Skewed
m3 has 151676 (99.8%) zeros Zeros
m4 is highly skewed (γ1 = 45.46357027) Skewed
m4 has 151668 (99.8%) zeros Zeros
m5 is highly skewed (γ1 = 39.48756787) Skewed
m5 has 151542 (99.7%) zeros Zeros
m6 is highly skewed (γ1 = 42.22134397) Skewed
m6 has 151587 (99.8%) zeros Zeros
m7 is highly skewed (γ1 = 41.98089514) Skewed
m7 has 151510 (99.7%) zeros Zeros
m8 is highly skewed (γ1 = 40.26684127) Skewed
m8 has 151478 (99.7%) zeros Zeros
m9 is highly skewed (γ1 = 41.08384865) Skewed
m9 has 151447 (99.7%) zeros Zeros
number_of_borrowers is highly correlated with co-borrower_credit_score (ρ = 0.9965738849) Rejected
origination_date has 15051 (9.9%) zeros Zeros

Variables

borrower_credit_score
Numeric

Distinct count223
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean769.9267134
Minimum0
Maximum840
Zeros (%)< 0.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile689
Q1751
Median782
Q3799
95-th percentile813
Maximum840
Range840
Interquartile range48

Descriptive statistics

Standard deviation42.10920696
Coef of variation0.05469248726
Kurtosis47.69930716
Mean769.9267134
MAD31.15845887
Skewness-3.526468102
Sum116970346
Variance1773.185311
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 0. 240. 619.5 629.5 639.5 ... 828.5 829.5 831.5 832.5 840. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
801 3025 2.0%
 
809 2965 2.0%
 
802 2677 1.8%
 
808 2645 1.7%
 
791 2610 1.7%
 
800 2424 1.6%
 
797 2420 1.6%
 
790 2356 1.6%
 
798 2354 1.5%
 
804 2340 1.5%
 
Other values (213) 126108 83.0%
 

Minimum 5 values

ValueCountFrequency (%) 
0 65 < 0.1%
 
480 1 < 0.1%
 
559 1 < 0.1%
 
619 1 < 0.1%
 
620 29 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
840 1 < 0.1%
 
839 1 < 0.1%
 
838 1 < 0.1%
 
835 1 < 0.1%
 
834 5 < 0.1%
 

co-borrower_credit_score
Numeric

Distinct count216
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean460.2785143
Minimum0
Maximum836
Zeros (%)40.6%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median741
Q3791
95-th percentile812
Maximum836
Range836
Interquartile range791

Descriptive statistics

Standard deviation381.7984428
Coef of variation0.8294943843
Kurtosis-1.847146868
Mean460.2785143
MAD374.0664253
Skewness-0.3661982584
Sum69927353
Variance145770.0509
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 0. 310. 620.5 631.5 642.5 ... 822.5 823.5 826.5 829.5 836. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 61734 40.6%
 
801 2000 1.3%
 
809 1982 1.3%
 
808 1916 1.3%
 
802 1773 1.2%
 
791 1715 1.1%
 
797 1621 1.1%
 
790 1599 1.1%
 
799 1572 1.0%
 
796 1542 1.0%
 
Other values (206) 74470 49.0%
 

Minimum 5 values

ValueCountFrequency (%) 
0 61734 40.6%
 
620 9 < 0.1%
 
621 12 < 0.1%
 
622 10 < 0.1%
 
623 13 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
836 1 < 0.1%
 
834 2 < 0.1%
 
832 7 < 0.1%
 
831 2 < 0.1%
 
830 6 < 0.1%
 

debt_to_income_ratio
Numeric

Distinct count58
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean30.74714989
Minimum1
Maximum64
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile14
Q123
Median31
Q339
95-th percentile45
Maximum64
Range63
Interquartile range16

Descriptive statistics

Standard deviation9.729671905
Coef of variation0.3164414243
Kurtosis-0.8250455102
Mean30.74714989
MAD8.21887546
Skewness-0.1939405534
Sum4671230
Variance94.66651538
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 1. 2.5 4.5 6.5 7.5 ... 45.5 49.5 50.5 51.5 64. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
44 5460 3.6%
 
43 5217 3.4%
 
42 5143 3.4%
 
30 4984 3.3%
 
41 4979 3.3%
 
40 4963 3.3%
 
39 4960 3.3%
 
28 4953 3.3%
 
31 4944 3.3%
 
29 4939 3.3%
 
Other values (48) 101382 66.7%
 

Minimum 5 values

ValueCountFrequency (%) 
1 9 < 0.1%
 
2 26 < 0.1%
 
3 38 < 0.1%
 
4 52 < 0.1%
 
5 98 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
64 1 < 0.1%
 
61 1 < 0.1%
 
58 1 < 0.1%
 
56 1 < 0.1%
 
55 3 < 0.1%
 

df_index
Numeric

Distinct count116058
Unique (%)76.4%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean48562.69383
Minimum0
Maximum116057
Zeros (%)< 0.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile3798
Q118990
Median40095.5
Q378076.25
95-th percentile108460.85
Maximum116057
Range116057
Interquartile range59086.25

Descriptive statistics

Standard deviation34245.0233
Coef of variation0.7051714104
Kurtosis-1.142378432
Mean48562.69383
MAD29985.70903
Skewness0.3956263101
Sum7377838698
Variance1172721621
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 0. 35865.5 116057. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2047 2 < 0.1%
 
26956 2 < 0.1%
 
22858 2 < 0.1%
 
16713 2 < 0.1%
 
18760 2 < 0.1%
 
12615 2 < 0.1%
 
14662 2 < 0.1%
 
8517 2 < 0.1%
 
10564 2 < 0.1%
 
4419 2 < 0.1%
 
Other values (116048) 151904 > 99.9%
 

Minimum 5 values

ValueCountFrequency (%) 
0 2 < 0.1%
 
1 2 < 0.1%
 
2 2 < 0.1%
 
3 2 < 0.1%
 
4 2 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
116057 1 < 0.1%
 
116056 1 < 0.1%
 
116055 1 < 0.1%
 
116054 1 < 0.1%
 
116053 1 < 0.1%
 

financial_institution
Numeric

Distinct count19
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean6.508741213
Minimum0
Maximum18
Zeros (%)0.4%
Mini histogram

Quantile statistics

Minimum0
5-th percentile1
Q11
Median8
Q38
95-th percentile15
Maximum18
Range18
Interquartile range7

Descriptive statistics

Standard deviation4.455992634
Coef of variation0.6846166545
Kurtosis-0.2453396206
Mean6.508741213
MAD3.584132731
Skewness0.4514386931
Sum988834
Variance19.85587035
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=19)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 14.5 15.5 16.5 17.5 18. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
8 64861 42.7%
 
1 41930 27.6%
 
15 8969 5.9%
 
4 6387 4.2%
 
5 6163 4.1%
 
6 4070 2.7%
 
7 2712 1.8%
 
18 2388 1.6%
 
14 2376 1.6%
 
3 2134 1.4%
 
Other values (9) 9934 6.5%
 

Minimum 5 values

ValueCountFrequency (%) 
0 635 0.4%
 
1 41930 27.6%
 
2 502 0.3%
 
3 2134 1.4%
 
4 6387 4.2%
 

Maximum 5 values

ValueCountFrequency (%) 
18 2388 1.6%
 
17 867 0.6%
 
16 1656 1.1%
 
15 8969 5.9%
 
14 2376 1.6%
 

first_payment_date
Numeric

Distinct count8
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean2.533253469
Minimum0
Maximum7
Zeros (%)0.3%
Mini histogram

Quantile statistics

Minimum0
5-th percentile1
Q11
Median2
Q33
95-th percentile6
Maximum7
Range7
Interquartile range2

Descriptive statistics

Standard deviation1.694876556
Coef of variation0.6690513117
Kurtosis0.4180710848
Mean2.533253469
MAD1.350807818
Skewness1.190135588
Sum384862
Variance2.872606541
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%) 
2 52840 34.8%
 
1 47680 31.4%
 
4 16551 10.9%
 
3 15014 9.9%
 
6 14661 9.7%
 
7 4510 3.0%
 
0 524 0.3%
 
5 144 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 524 0.3%
 
1 47680 31.4%
 
2 52840 34.8%
 
3 15014 9.9%
 
4 16551 10.9%
 

Maximum 5 values

ValueCountFrequency (%) 
7 4510 3.0%
 
6 14661 9.7%
 
5 144 0.1%
 
4 16551 10.9%
 
3 15014 9.9%
 

insurance_percent
Numeric

Distinct count14
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean2.772860114
Minimum0
Maximum40
Zeros (%)88.3%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile25
Maximum40
Range40
Interquartile range0

Descriptive statistics

Standard deviation8.080633809
Coef of variation2.914187329
Kurtosis5.923881439
Mean2.772860114
MAD4.899138106
Skewness2.753167499
Sum421264
Variance65.29664275
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=14)
Histogram
Histogram with variable size bins (bins=[ 0. 3. 9. 13.5 15.5 ... 23.5 27.5 32.5 37. 40. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 134211 88.3%
 
30 6690 4.4%
 
25 6331 4.2%
 
12 3249 2.1%
 
6 906 0.6%
 
35 483 0.3%
 
16 29 < 0.1%
 
18 14 < 0.1%
 
17 4 < 0.1%
 
20 3 < 0.1%
 
Other values (4) 4 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 134211 88.3%
 
6 906 0.6%
 
12 3249 2.1%
 
15 1 < 0.1%
 
16 29 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
40 1 < 0.1%
 
39 1 < 0.1%
 
35 483 0.3%
 
30 6690 4.4%
 
25 6331 4.2%
 

insurance_type
Boolean

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
0
151432
1
 
492
ValueCountFrequency (%) 
0 151432 99.7%
 
1 492 0.3%
 

interest_rate
Numeric

Distinct count1121
Unique (%)0.7%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean3.869878854
Minimum2.25
Maximum6.75
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum2.25
5-th percentile3.125
Q13.5
Median3.875
Q34.125
95-th percentile4.625
Maximum6.75
Range4.5
Interquartile range0.625

Descriptive statistics

Standard deviation0.4609075925
Coef of variation0.1191012975
Kurtosis0.1754651707
Mean3.869878854
MAD0.349192025
Skewness0.04916428942
Sum587927.475
Variance0.2124358088
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[2.25 2.5625 2.6575 2.7375 2.753 ... 5.6025 5.6575 5.72 5.8125 6.75 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
3.875 27645 18.2%
 
4 18788 12.4%
 
3.25 12702 8.4%
 
3.75 10905 7.2%
 
4.125 10505 6.9%
 
4.25 9848 6.5%
 
4.375 9232 6.1%
 
3.375 8442 5.6%
 
4.5 5984 3.9%
 
3.5 5719 3.8%
 
Other values (1111) 32154 21.2%
 

Minimum 5 values

ValueCountFrequency (%) 
2.25 1 < 0.1%
 
2.375 8 < 0.1%
 
2.5 10 < 0.1%
 
2.625 27 < 0.1%
 
2.69 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
6.75 1 < 0.1%
 
6.625 2 < 0.1%
 
6.5 4 < 0.1%
 
6.25 1 < 0.1%
 
6 4 < 0.1%
 

loan_id
Numeric

Distinct count151924
Unique (%)100.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean4.197103013e+11
Minimum1
Maximum9.999970752e+11
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile7597.15
Q11.158882203e+11
Median4.103648985e+11
Q37.040455953e+11
95-th percentile9.41045715e+11
Maximum9.999970752e+11
Range9.999970752e+11
Interquartile range5.88157375e+11

Descriptive statistics

Standard deviation3.255504408e+11
Coef of variation0.7756551122
Kurtosis-1.304707718
Mean4.197103013e+11
MAD2.850546401e+11
Skewness0.1253282458
Sum6.376406781e+16
Variance1.059830895e+23
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[1.00000000e+00 3.58655000e+04 1.00003610e+11 9.99997075e+11], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2047 1 < 0.1%
 
7.242013771e+11 1 < 0.1%
 
3.205701052e+11 1 < 0.1%
 
4.211298154e+11 1 < 0.1%
 
9.733137421e+11 1 < 0.1%
 
2.127150667e+11 1 < 0.1%
 
3.461370177e+11 1 < 0.1%
 
7.18050975e+11 1 < 0.1%
 
5.373193844e+11 1 < 0.1%
 
6.476632221e+11 1 < 0.1%
 
Other values (151914) 151914 > 99.9%
 

Minimum 5 values

ValueCountFrequency (%) 
1 1 < 0.1%
 
2 1 < 0.1%
 
3 1 < 0.1%
 
4 1 < 0.1%
 
5 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
9.999970752e+11 1 < 0.1%
 
9.999731575e+11 1 < 0.1%
 
9.999506216e+11 1 < 0.1%
 
9.999279843e+11 1 < 0.1%
 
9.999180308e+11 1 < 0.1%
 

loan_purpose
Categorical

Distinct count3
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
0
76354
1
38580
2
36990
ValueCountFrequency (%) 
0 76354 50.3%
 
1 38580 25.4%
 
2 36990 24.3%
 
Max length1
Mean length1
Min length1
Contains charsFalse
Contains digitsTrue
Contains spacesFalse
Contains non-wordsFalse

loan_term
Numeric

Distinct count149
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean292.4814578
Minimum60
Maximum360
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum60
5-th percentile120
Q1180
Median360
Q3360
95-th percentile360
Maximum360
Range300
Interquartile range180

Descriptive statistics

Standard deviation89.65361292
Coef of variation0.3065275098
Kurtosis-1.288368081
Mean292.4814578
MAD83.95940176
Skewness-0.6965707867
Sum44434953
Variance8037.77031
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 60. 83.5 84.5 95. 98. ... 337. 347.5 348.5 359.5 360. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
360 94127 62.0%
 
180 37330 24.6%
 
120 9158 6.0%
 
240 8879 5.8%
 
300 1194 0.8%
 
96 226 0.1%
 
156 124 0.1%
 
144 104 0.1%
 
336 92 0.1%
 
324 59 < 0.1%
 
Other values (139) 631 0.4%
 

Minimum 5 values

ValueCountFrequency (%) 
60 8 < 0.1%
 
71 2 < 0.1%
 
72 2 < 0.1%
 
76 1 < 0.1%
 
77 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
360 94127 62.0%
 
359 1 < 0.1%
 
358 1 < 0.1%
 
357 1 < 0.1%
 
355 4 < 0.1%
 

loan_to_value
Numeric

Distinct count93
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean67.42164503
Minimum5
Maximum97
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum5
5-th percentile33
Q157
Median72
Q380
95-th percentile92
Maximum97
Range92
Interquartile range23

Descriptive statistics

Standard deviation17.28106455
Coef of variation0.2563133032
Kurtosis0.08119119491
Mean67.42164503
MAD13.99575678
Skewness-0.7659145465
Sum10242966
Variance298.6351919
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 5. 9.5 13.5 15.5 18.5 ... 91.5 94.5 95.5 96.5 97. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
80 29301 19.3%
 
75 10744 7.1%
 
95 5698 3.8%
 
70 4458 2.9%
 
90 4274 2.8%
 
79 3606 2.4%
 
60 3346 2.2%
 
74 3278 2.2%
 
78 3039 2.0%
 
72 2922 1.9%
 
Other values (83) 81258 53.5%
 

Minimum 5 values

ValueCountFrequency (%) 
5 1 < 0.1%
 
6 6 < 0.1%
 
7 8 < 0.1%
 
8 13 < 0.1%
 
9 15 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
97 815 0.5%
 
96 59 < 0.1%
 
95 5698 3.8%
 
94 488 0.3%
 
93 470 0.3%
 

m1
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
0
151507
1
 
368
2
 
42
ValueCountFrequency (%) 
0 151507 99.7%
 
1 368 0.2%
 
2 42 < 0.1%
 
3 7 < 0.1%
 
Max length1
Mean length1
Min length1
Contains charsFalse
Contains digitsTrue
Contains spacesFalse
Contains non-wordsFalse

m10
Numeric

Distinct count12
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.006365024618
Minimum0
Maximum12
Zeros (%)99.6%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile0
Maximum12
Range12
Interquartile range0

Descriptive statistics

Standard deviation0.1424967343
Coef of variation22.38746004
Kurtosis2019.149222
Mean0.006365024618
MAD0.01268203629
Skewness39.16526857
Sum967
Variance0.02030531928
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=12)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 4.5 8.5 12. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 151351 99.6%
 
1 426 0.3%
 
2 63 < 0.1%
 
3 28 < 0.1%
 
4 18 < 0.1%
 
6 11 < 0.1%
 
5 9 < 0.1%
 
7 7 < 0.1%
 
8 5 < 0.1%
 
9 4 < 0.1%
 
Other values (2) 2 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 151351 99.6%
 
1 426 0.3%
 
2 63 < 0.1%
 
3 28 < 0.1%
 
4 18 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
12 1 < 0.1%
 
11 1 < 0.1%
 
9 4 < 0.1%
 
8 5 < 0.1%
 
7 7 < 0.1%
 

m11
Numeric

Distinct count13
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.006885021458
Minimum0
Maximum13
Zeros (%)99.6%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile0
Maximum13
Range13
Interquartile range0

Descriptive statistics

Standard deviation0.1580873056
Coef of variation22.96104762
Kurtosis1919.391209
Mean0.006885021458
MAD0.01371828879
Skewness38.84844958
Sum1046
Variance0.02499159619
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=13)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 7.5 13. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 151353 99.6%
 
1 410 0.3%
 
2 64 < 0.1%
 
3 29 < 0.1%
 
4 16 < 0.1%
 
5 15 < 0.1%
 
7 12 < 0.1%
 
6 10 < 0.1%
 
8 6 < 0.1%
 
9 4 < 0.1%
 
Other values (3) 5 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 151353 99.6%
 
1 410 0.3%
 
2 64 < 0.1%
 
3 29 < 0.1%
 
4 16 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
13 1 < 0.1%
 
11 1 < 0.1%
 
10 3 < 0.1%
 
9 4 < 0.1%
 
8 6 < 0.1%
 

m12
Numeric

Distinct count13
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.007892103947
Minimum0
Maximum14
Zeros (%)99.6%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile0
Maximum14
Range14
Interquartile range0

Descriptive statistics

Standard deviation0.1741496753
Coef of variation22.06631799
Kurtosis1799.164329
Mean0.007892103947
MAD0.01571740314
Skewness37.68468478
Sum1199
Variance0.03032810942
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=13)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 3.5 7.5 10.5 14. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 151281 99.6%
 
1 469 0.3%
 
2 58 < 0.1%
 
3 39 < 0.1%
 
6 18 < 0.1%
 
4 17 < 0.1%
 
7 11 < 0.1%
 
5 11 < 0.1%
 
8 7 < 0.1%
 
10 5 < 0.1%
 
Other values (3) 8 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 151281 99.6%
 
1 469 0.3%
 
2 58 < 0.1%
 
3 39 < 0.1%
 
4 17 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
14 1 < 0.1%
 
11 3 < 0.1%
 
10 5 < 0.1%
 
9 4 < 0.1%
 
8 7 < 0.1%
 

m2
Numeric

Distinct count5
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.002119480793
Minimum0
Maximum4
Zeros (%)99.8%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile0
Maximum4
Range4
Interquartile range0

Descriptive statistics

Standard deviation0.0532827694
Coef of variation25.13953869
Kurtosis1471.553035
Mean0.002119480793
MAD0.004231093263
Skewness32.95069434
Sum322
Variance0.002839053515
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=5)
Histogram
Histogram with variable size bins (bins=[0. 0.5 1.5 2.5 4. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 151642 99.8%
 
1 254 0.2%
 
2 19 < 0.1%
 
3 6 < 0.1%
 
4 3 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 151642 99.8%
 
1 254 0.2%
 
2 19 < 0.1%
 
3 6 < 0.1%
 
4 3 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
4 3 < 0.1%
 
3 6 < 0.1%
 
2 19 < 0.1%
 
1 254 0.2%
 
0 151642 99.8%
 

m3
Numeric

Distinct count6
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.001968089308
Minimum0
Maximum5
Zeros (%)99.8%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile0
Maximum5
Range5
Interquartile range0

Descriptive statistics

Standard deviation0.05576330242
Coef of variation28.33372561
Kurtosis2608.491647
Mean0.001968089308
MAD0.003929753217
Skewness42.90379644
Sum299
Variance0.003109545897
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=6)
Histogram
Histogram with variable size bins (bins=[0. 0.5 1.5 2.5 5. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 151676 99.8%
 
1 220 0.1%
 
2 15 < 0.1%
 
3 6 < 0.1%
 
4 4 < 0.1%
 
5 3 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 151676 99.8%
 
1 220 0.1%
 
2 15 < 0.1%
 
3 6 < 0.1%
 
4 4 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
5 3 < 0.1%
 
4 4 < 0.1%
 
3 6 < 0.1%
 
2 15 < 0.1%
 
1 220 0.1%
 

m4
Numeric

Distinct count7
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.002139227508
Minimum0
Maximum6
Zeros (%)99.8%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile0
Maximum6
Range6
Interquartile range0

Descriptive statistics

Standard deviation0.06148362155
Coef of variation28.74103914
Kurtosis2919.835905
Mean0.002139227508
MAD0.004271245593
Skewness45.46357027
Sum325
Variance0.003780235718
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=7)
Histogram
Histogram with variable size bins (bins=[0. 0.5 1.5 3.5 6. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 151668 99.8%
 
1 220 0.1%
 
2 18 < 0.1%
 
3 9 < 0.1%
 
4 5 < 0.1%
 
6 2 < 0.1%
 
5 2 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 151668 99.8%
 
1 220 0.1%
 
2 18 < 0.1%
 
3 9 < 0.1%
 
4 5 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
6 2 < 0.1%
 
5 2 < 0.1%
 
4 5 < 0.1%
 
3 9 < 0.1%
 
2 18 < 0.1%
 

m5
Numeric

Distinct count8
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.003337194913
Minimum0
Maximum7
Zeros (%)99.7%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile0
Maximum7
Range7
Interquartile range0

Descriptive statistics

Standard deviation0.0802054671
Coef of variation24.0337976
Kurtosis2224.819399
Mean0.003337194913
MAD0.00665760764
Skewness39.48756787
Sum507
Variance0.006432916953
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=8)
Histogram
Histogram with variable size bins (bins=[0. 0.5 1.5 2.5 3.5 7. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 151542 99.7%
 
1 315 0.2%
 
2 39 < 0.1%
 
3 13 < 0.1%
 
4 6 < 0.1%
 
5 5 < 0.1%
 
7 2 < 0.1%
 
6 2 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 151542 99.7%
 
1 315 0.2%
 
2 39 < 0.1%
 
3 13 < 0.1%
 
4 6 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
7 2 < 0.1%
 
6 2 < 0.1%
 
5 5 < 0.1%
 
4 6 < 0.1%
 
3 13 < 0.1%
 

m6
Numeric

Distinct count9
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.003172638951
Minimum0
Maximum8
Zeros (%)99.8%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile0
Maximum8
Range8
Interquartile range0

Descriptive statistics

Standard deviation0.08315344108
Coef of variation26.20955059
Kurtosis2445.116944
Mean0.003172638951
MAD0.006331202715
Skewness42.22134397
Sum482
Variance0.006914494764
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=9)
Histogram
Histogram with variable size bins (bins=[0. 0.5 1.5 3.5 4.5 8. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 151587 99.8%
 
1 267 0.2%
 
2 31 < 0.1%
 
3 20 < 0.1%
 
4 10 < 0.1%
 
5 4 < 0.1%
 
6 3 < 0.1%
 
8 1 < 0.1%
 
7 1 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 151587 99.8%
 
1 267 0.2%
 
2 31 < 0.1%
 
3 20 < 0.1%
 
4 10 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
8 1 < 0.1%
 
7 1 < 0.1%
 
6 3 < 0.1%
 
5 4 < 0.1%
 
4 10 < 0.1%
 

m7
Numeric

Distinct count10
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.003975672047
Minimum0
Maximum9
Zeros (%)99.7%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile0
Maximum9
Range9
Interquartile range0

Descriptive statistics

Standard deviation0.09774881935
Coef of variation24.58674111
Kurtosis2397.358639
Mean0.003975672047
MAD0.007929676309
Skewness41.98089514
Sum604
Variance0.009554831685
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[0. 0.5 1.5 2.5 5.5 9. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 151510 99.7%
 
1 333 0.2%
 
2 35 < 0.1%
 
3 17 < 0.1%
 
4 12 < 0.1%
 
5 8 < 0.1%
 
6 4 < 0.1%
 
7 3 < 0.1%
 
9 1 < 0.1%
 
8 1 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 151510 99.7%
 
1 333 0.2%
 
2 35 < 0.1%
 
3 17 < 0.1%
 
4 12 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
9 1 < 0.1%
 
8 1 < 0.1%
 
7 3 < 0.1%
 
6 4 < 0.1%
 
5 8 < 0.1%
 

m8
Numeric

Distinct count10
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.004554909033
Minimum0
Maximum10
Zeros (%)99.7%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile0
Maximum10
Range10
Interquartile range0

Descriptive statistics

Standard deviation0.1086931248
Coef of variation23.86285301
Kurtosis2201.81927
Mean0.004554909033
MAD0.009083074571
Skewness40.26684127
Sum692
Variance0.01181419537
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 6.5 10. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 151478 99.7%
 
1 335 0.2%
 
2 59 < 0.1%
 
3 15 < 0.1%
 
4 14 < 0.1%
 
5 11 < 0.1%
 
6 6 < 0.1%
 
7 4 < 0.1%
 
10 1 < 0.1%
 
9 1 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 151478 99.7%
 
1 335 0.2%
 
2 59 < 0.1%
 
3 15 < 0.1%
 
4 14 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
10 1 < 0.1%
 
9 1 < 0.1%
 
7 4 < 0.1%
 
6 6 < 0.1%
 
5 11 < 0.1%
 

m9
Numeric

Distinct count11
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.005114399305
Minimum0
Maximum11
Zeros (%)99.7%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile0
Maximum11
Range11
Interquartile range0

Descriptive statistics

Standard deviation0.1221025806
Coef of variation23.874276
Kurtosis2252.748849
Mean0.005114399305
MAD0.01019668297
Skewness41.08384865
Sum777
Variance0.01490904018
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=11)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 3.5 7.5 11. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 151447 99.7%
 
1 358 0.2%
 
2 49 < 0.1%
 
3 30 < 0.1%
 
5 11 < 0.1%
 
4 10 < 0.1%
 
6 8 < 0.1%
 
7 5 < 0.1%
 
8 4 < 0.1%
 
11 1 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
0 151447 99.7%
 
1 358 0.2%
 
2 49 < 0.1%
 
3 30 < 0.1%
 
4 10 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
11 1 < 0.1%
 
10 1 < 0.1%
 
8 4 < 0.1%
 
7 5 < 0.1%
 
6 8 < 0.1%
 

number_of_borrowers
Highly correlated

This variable is highly correlated with co-borrower_credit_score and should be ignored for analysis

Correlation0.9965738849

origination_date
Numeric

Distinct count6
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean2.99476712
Minimum0
Maximum5
Zeros (%)9.9%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q13
Median3
Q34
95-th percentile5
Maximum5
Range5
Interquartile range1

Descriptive statistics

Standard deviation1.443126178
Coef of variation0.4818826038
Kurtosis-0.3214225579
Mean2.99476712
MAD1.082163681
Skewness-0.80553738
Sum454977
Variance2.082613165
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%) 
4 52334 34.4%
 
3 49093 32.3%
 
1 16423 10.8%
 
0 15051 9.9%
 
5 14631 9.6%
 
2 4392 2.9%
 

Minimum 5 values

ValueCountFrequency (%) 
0 15051 9.9%
 
1 16423 10.8%
 
2 4392 2.9%
 
3 49093 32.3%
 
4 52334 34.4%
 

Maximum 5 values

ValueCountFrequency (%) 
5 14631 9.6%
 
4 52334 34.4%
 
3 49093 32.3%
 
2 4392 2.9%
 
1 16423 10.8%
 

source
Categorical

Distinct count3
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
0
83572
1
49207
2
19145
ValueCountFrequency (%) 
0 83572 55.0%
 
1 49207 32.4%
 
2 19145 12.6%
 
Max length1
Mean length1
Min length1
Contains charsFalse
Contains digitsTrue
Contains spacesFalse
Contains non-wordsFalse

unpaid_principal_bal
Numeric

Distinct count660
Unique (%)0.4%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean208117.3021
Minimum11000
Maximum1200000
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum11000
5-th percentile65000
Q1120000
Median182000
Q3278000
95-th percentile417000
Maximum1200000
Range1189000
Interquartile range158000

Descriptive statistics

Standard deviation114655.7805
Coef of variation0.5509190219
Kurtosis0.5661005638
Mean208117.3021
MAD92682.3322
Skewness0.9004530763
Sum3.1618013e+10
Variance1.3145948e+10
Memory size1.2 MiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 11000. 17500. 23500. 29500. 30500. ... 624500. 628000. 724000. 858500. 1200000.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
417000 3530 2.3%
 
100000 1847 1.2%
 
200000 1726 1.1%
 
150000 1587 1.0%
 
120000 1210 0.8%
 
140000 1137 0.7%
 
160000 1109 0.7%
 
300000 1106 0.7%
 
180000 1097 0.7%
 
125000 990 0.7%
 
Other values (650) 136585 89.9%
 

Minimum 5 values

ValueCountFrequency (%) 
11000 1 < 0.1%
 
14000 5 < 0.1%
 
15000 5 < 0.1%
 
16000 4 < 0.1%
 
17000 2 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
1200000 1 < 0.1%
 
968000 1 < 0.1%
 
915000 1 < 0.1%
 
802000 4 < 0.1%
 
800000 2 < 0.1%
 

Correlations

Missing values

Sample

First rows

borrower_credit_scoreco-borrower_credit_scoredebt_to_income_ratiodf_indexfinancial_institutionfirst_payment_dateinsurance_percentinsurance_typeinterest_rateloan_idloan_purposeloan_termloan_to_valuem1m10m11m12m2m3m4m5m6m7m8m9number_of_borrowersorigination_datesourceunpaid_principal_bal
0694.00.022.0018330.004.2502680550086192360950000000001001.052214000
1697.00.044.011510.004.8756728316576271360720010000000001.031144000
2780.00.033.021710.003.2507425152421081180490000000000001.032366000
3633.0638.044.03820.004.7506013856674621360460111000000012.040135000
4681.00.043.04820.004.750273870029961236080091011123456781.040124000
5675.00.046.05120.004.3757690600244642360801000000000001.041150000
6723.00.044.068230.004.0001480716146872360950000000000001.04059000
7652.00.045.07110.004.5008533839532660300620000100000001.031319000
8808.00.035.08130.004.0004235900723352360760201000101011.050520000
9702.0700.041.098130.004.0003089908468160360950122000011112.030214000

Last rows

borrower_credit_scoreco-borrower_credit_scoredebt_to_income_ratiodf_indexfinancial_institutionfirst_payment_dateinsurance_percentinsurance_typeinterest_rateloan_idloan_purposeloan_termloan_to_valuem1m10m11m12m2m3m4m5m6m7m8m9number_of_borrowersorigination_datesourceunpaid_principal_bal
151914803.0799.038.035856860.004.250358572360730000000000002.000240000
151915776.00.022.035857170.004.000358581360800000000000001.010304000
151916764.00.044.035858140.004.125358590180640000000000001.01079000
151917698.00.045.035859860.004.000358601360640000001000001.000237000
151918791.0781.024.035860140.003.250358610180800000000000002.011226000
151919684.0712.030.035861840.004.125358622240800000000000002.000232000
151920812.00.030.035862460.003.375358631180800000000000001.002204000
151921624.0646.038.035863140.004.250358641360520000000000012.010200000
151922753.00.034.035864440.004.375358650360660000000000001.011400000
1519230.00.03.035865840.004.375358662360700000000000001.010182000